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Abstract — In this paper we propose a new cooperative packet 
transmission scheme that allows independent sources to super- 
impose over-the-air their packet transmissions. Relay nodes are 
used and cooperative diversity is combined with distributed 
space-time block coding (STBC). With the proposed scheme 
the participating relays create a ST code for the over-the-air 
superimposed symbols that are received locally and without 
proceeding to any decoding step beforehand. The advantage 
of the proposed scheme is that communication is completed 
in fewer transmission slots because of the concurrent packet 
transmissions, while the diversity benefit from the use of the 
STBC results in higher decoding performance. The proposed 
scheme does not depend on the STBC that is applied at the 
relays. Simulation results reveal significant throughput benefits 
even in the low SNR regime. 

I. Introduction 

Improving the reliability and throughput of wireless services 
has always been one of the main motivations of the wireless 
communications research community. Towards these two ob- 
jectives, the recent years there is a significant interest around 
the idea of allowing packets to interfere or be superimposed 
over the air. The term physical layer network coding (PLNC) 
is also used for the previous concept. With PLNC, interference 
is used constructively for maximizing system throughput and 
in most cases of PLNC-based systems it is something that is 
controlled. To be effective, there is a need for a type of node 
cooperation. Unlike the first works that considered this form of 
cooperation with the mindset towards maximizing the through- 
put of point-to-point links with bidirectional traffic 0, 0, 0, 
PLNC can have more substantial impact on the performance of 
more complex and even non-canonical wireless networks |0J. 
The general idea in all these works is that wireless signals 
are allowed to interfere or to be superimposed. The problems 
that arise during the signal recovery process can either be 
addressed with pre-coding steps [5|, decoding at the relay if 
possible 0, use of a-priori knowledge at the receivers 0, 0, 
and specialized decoder design 0, 0. One important result 
from the aforementioned works is that the existence of a-priori 
knowledge in the form of already decoded packets, is the only 
way to increase the decoding performance if more than two 
signals interfere 0, 0. Furthermore, all the aforementioned 
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Fig. 1. System model for the cooperative scheme that employs distributed 
Space-Time Superimposed Symbol Coding (STSSC). Different line styles 
indicate transmissions in different time slots. For the channels with the dashed 
lines not all connections are depicted to avoid clogging the figure. 



schemes do not process the mixed signal at the relays for 
further performance improvements. 

A different class of works that employs relay processing, but 
on non-interfering signals, is the randomized distributed space 
time coding (R-DSTC) [8|. With this scheme sources transmit 
one at a time, and then the relay nodes forward a random 
linear combination of the decoded packets. One disadvantage 
of systems that fully decode the signals at the relay is the 
low achieved rate. To alleviate this issue, the authors in 
developed space-time network coding. The notion of space- 
time cooperation in that work comes from the use of multiple 
relays and algebraic network coding across different packet 
transmissions. A similar idea that requires no decoding at the 
relays, but at the same time it does not exploit any form 
of interfering signals, was presented in ifTOl . In that work 
the main idea is that the transmitter sends a packet in one 
slot, while in a number of subsequent slots the relays encode 
their received signals into a "distributed" linear dispersion 
(LD) code, and then they transmit the coded signals to the 
destination node. 

In our effort to increase the throughput performance of 
wireless networks, and allow more nodes to superimpose their 
signals over-the-air, in this paper we propose a new form of 
distributed space-time-coded (STC) cooperation that is named 
distributed space-time superimposed symbol coding (STSSC). 
Our scheme aims at a generalization of distributed STCs for 
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Fig. 2. Behavior of STSSC in the time domain. With STSSC the signal that 
is forwarded during one symbol slot is a linear combination (dashed lines) 
of all the superimposed symbols q r that were received during the broadcast 
phase (grey ellipses). 

the case of signals that are mixed or superimposed over-the- 
air. The STC is created by linearly coding across time different 
versions of the same superimposed symbols. 

II. System Model and Overview 

Our study focuses on the relay network model where a 
group S = {Si, 5*2, Sn} of sources want to communicate 
with a group T> = {D\,D2, Dn} of destinations with the 
assistance of a set 7Z — {R\, R2, Rm} relays. Every node 
has a single omnidirectional antenna that can be used both 
for transmission and reception and they all have the same 
average power constraint. We denote the channel from the s-th 
transmitter to the r-th relay as h s , r , and the channel from the 
r-th relay to destination d as /j r ,d- We also assume that h s , r 
and h r> d are independent complex Gaussian random variables 
with zero-mean and unit-variance. In Fig. Q] we present the 
topology that we study in this paper and it includes the sources, 
the relays, and the destinations. The transmission of a packet 
requires two hops since we assume that there is no direct link 
between the sources and the destinations. All the channels, 
from sources to relays and relays to destinations are considered 
to be block-fading Rayleigh. The channel is assumed to remain 
constant for the coherence period of the channel that is 
T symbols and each source/relay and relay/destination pair. 
Additive white Gaussian noise (AWGN) with zero mean and 
unit variance is assumed at the relays and the destinations. 

To model the system more effectively we introduce the term 
symbol slot which is the basic time unit that we consider in this 
paper and it corresponds to the transmission time of a PHY 
symbol. A transmission phase is a system defined parameter 
and denotes the time period where a user (source or relay) 
can transmit a packet and it consists of many symbol slots. 
The cooperative packet transmission requires N transmission 
phases as defined previously. In the first transmission phase, 
all the source stations transmit/broadcast the packet to all the 
potential relay stations. The relays do not try to decode but 
instead they apply a distributed STC to the signals that are 
received in a superimposed form locally, and then they forward 



them to the destinations. Note the key difference with existing 
distributed STC schemes is that information symbols are not 
available at the relay for decoding but only their superimposed 
versions. The superimposed signals can be coded with any type 
of distributed STC (e.g. orthogonal or LD) code, while then 
they are sequentially forwarded from the relays. This relay 
transmission order can be random but it should be decided 
in advance of a communication phase. At each destination 
the forwarded signals from the relays are jointly ML decoded 
while we assume receiver channel state information (CSI) is 
available. 

III. The distributed STSSC Scheme 

In this section we proceed by analyzing in detail the behav- 
ior of the proposed cooperative scheme. But first we define 
the paper notation. Matrices are denoted with bold capital 
letters, i.e. A. Bold lowercase denote vectors. The matrices 
A T , A H ,A*, are the transpose, Hermitian, and conjugate of 
A. The Euclidian norm is denoted by || • || and Tr( ) is the 
trace of a matrix. 

A. Source to Relay Transmission Phase 

Let c s [t] denote the symbol that source s wants to transmit 
during the symbol slot t. Then, the transmitted signal from 
all the sources is normalized to have unit energy during the 
coherence period of T symbols. The array of transmitted 
symbols is expressed as 
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The energy normalization process in this case corresponds to 
Tr(£?[Xf X5]) = 1, in order to compare fairly systems with 
different N and T. 

During the first communication phase that is described here, 
and from symbol slot 1 to T, a source s € S transmits the 
signal yfpx s [t] that is a part of larger packet. We assume 
that _E[|/i s r | 2 ] = 1, and since the average signal power is 
normalized to unity, and the noise term has zero mean and 1/2 
variance per dimension, p can be interpreted as the average 
signal-to-noise ratio (SNR) at the receiver. With w r [t] we 
denote the sample of the AWGN during symbol slot t, and 
hs,r G (£ixJV j s me channel matrix that contains the N 
channel gains from the first broadcast phase, i.e. from the 
group of sources S to relay r: 

hs, r = [hl, r hi.r ■■■ h_M,r] 

Now we can write in a compact form and with a vector 
notation the received signal during one symbol slot as 

Qr[t] = \/pbs,rXs[t] + W r [t], 

where x,s[t] g C Wxl is a column vector of the matrix X5. 
Note that at this stage the relay does not have a decodable 



signafl and so it cannot construct a STC for a specific symbol 
x s [t]. Instead, the relay constructs a STC for a superimposed 
signal that is a linear combination of several symbols. 

B. Relay Operation 

With STSSC we design the transmit signal at every relay r 
as a linear function of its received non-decodable over-the-air 
superimposed symbol (OSS) q r . This is done for each symbol 
while there are in total T symbols. If an orthogonal STBC is 
used, or even a more general linear dispersion (LD) code, the 
transmitted signal will be described in the following T x T 
matrix: 



Z r =g r J2(A t q r [t]+B t q*[t]) 



(1) 



Each column of this matrix is transmitted simultaneously 
from the relay r. All the M relays transmit similarly in 
their corresponding transmission phase. A t ,B t are the STC 
matrices [11|. What this expression demonstrates, is that a 
symbol to be transmitted in a forwarding symbol slot, is a 
linear combination of all the received symbols in the previous 
T symbol slots. This process is also clearly visible in Fig. [2] 
where the creation process of the ST-coded superimposed 
symbols is depicted graphically. 

Let us elaborate the previous expression and re-write it as: 
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At the destination, the received signal from one relay will then 
be 

T N 

Yr.d = ^/pg r K,dC^2^2( A t h s,rX s [t} + B t h* r x* s [t]) 

t=l s=l 

T 

+ 5^(A t t« r [t]+B t tu;[t]))+w r ,«i, (2) 
t=i 

where h r .d € C lxT contains the channel gain during T 
symbol slots and it remains unchanged according to our stated 
assumptions. The previous expression is important since it 
demonstrates that with this formulation and system design, 
we are able to express the signal at the receiver as a function 
of the transmitted signal from the sources and not just that 
of the superimposed signal at the relay. Thus, for each OSS 
that will be transmitted, a new STC is created by forming a 
linear combination of all the symbols received in the previous 
transmission phase that has a duration of T symbols. The 
proposed scheme can also be classified as distributed STSSC 
since the ST code is applied in a distributed fashion by the 
relays, while it is also based in ST coding of superimposed 
symbols. 

'The relay can decode with ML or an efficient sphere decoder but still the 
high BER makes this approach impractical. 



The relay also applies power scaling so as to maintain the 
power constraint. If a 1 is the noise variance, then the power 
scaling is given as 



9t 



pEtl\Kr\ 2 +° 2 



(3) 



We can express in a more concise form the matrix of the power 
amplification for all relays as the following M x M matrix: 



G = diag(.gi ... g r ... 

IV. Decoding 
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The proposed scheme can work with an arbitrary LD code. 
However, for keeping the analysis simple and for demonstrat- 
ing the main concept of the decoding approach more clearly, 
we assume that the ST code is orthogonal. Based on this 
choice, we present a new decoding algorithm that combines 
the classic approach for decoding general orthogonal designs 
and ML decoding for decoding superimposed/interfering sym- 
bols [11]. To proceed with the description of the decoding 
process let us first define an extended form of the signal that 
is received during one forwarding phase from relay r and its 
complex conjugate as follows 
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The primary ML decision problem we desire to solve is 
equivalent to minimizing the squared Euclidean distance met- 
ric ifTTl . From (0 and (ff) we have that this ML metric is 
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The decoding expression of d6) can be simplified to 
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At this stage, one of the key ideas of the decoding algorithm 
is to employ matched filtering for each symbol that was 
transmitted at each source. Thus, we have that the sufficient 
statistic we can get for each symbol is 
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Due to the properties of orthogonal STBCs we have that 
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This last expression is important since it demonstrates clearly 
that matched filtering for a symbol transmitted during slot t, 
decouples all the other symbols that we transmitted during any 
of the remaining symbol slots. Thus, the decoder processing 
is linear with respect to the number of T symbols transmitted 
from each source (for each symbol slot t a new group of 
symbols is decoded) but not with respect to the number of 
sources. 

After obtaining the sufficient statistic u s t for each specific 
information symbol x s t, decoding cannot proceed based only 
on this information. Without interference/superimposed sym- 
bols, this would normally be the case. Now, the algorithm must 
account the fact that u s t contains the impact of all the other 
symbols that were transmitted in the specific symbol slot t. 
To proceed with the final steps of the decoding algorithm we 
define the following scalar quantity in (IT) : 

M 

V s , t = J29r<t\h s , r \ 2 \\K,d\\ 2 (8) 
r=l 

The decoding proceeds as follows. The sufficient statistic for 
each symbol s, t is calculated with matched filtering, and then 
the metric for each symbol is obtained from (0 as 

M 

e s ,t = ||yr,d|| 2 - 2^/pu a>t x a [t] + pv Sit \x s [t}\ 2 . (9) 

r=l 

To decode the concurrently transmitted symbols we employ 
ML decoding. We write the ML rule for all sources as follows: 

N 

x s [i] = arg min }e Stt (10) 

Therefore, each group of symbols that are transmitted con- 
currently in symbol slot t is simultaneously decoded with 
ML decoding and the estimated result is the vector xs[i]. 
This is another important objective of this algorithm, i.e. to 
increase the diversity of interfered/superimposed symbols by 
introducing ST coding and linear processing. 

V. Performance Evaluation 

We implemented the proposed cooperative scheme and we 
evaluated the performance in terms of BER and through- 
put under different channel conditions through Monte Carlo 
simulations. Simulation results are presented for the point- 
to-point transmission mode (named Direct in the figures) 
as a reference for all our results. We also implemented the 
Distributed STC protocol, where transmissions occur inde- 
pendently without being superimposed and are subsequently 
decoded at the relay. The Distributed STC is then applied 
at the relays and the transmitted signals are combined with 
maximum ratio combining (MRC) and ML decoding at each 
destination [7|. Finally we also evaluated the performance of 
a scheme named AF — OST where packets are superimposed 
in the broadcast phase while they are sequentially amplified 
and forwarded for subsequent ML decoding at the destination. 
We present the averaged results for 2000 packet transmissions 
that have a packet length of 1000 bits. The channel bandwidth 




(a) BER (b) Throughput 



Fig. 3. BER and throughput results for two sources with Alamouti-type code 
with T = 2 and all \h x>y \ = 1. 




(a) BER (b) Throughput 



Fig. 4. BER and throughput results for three sources with C3.4 code, T = 4, 
and all j/| = !■ 



is 20 MHz, while the same path loss model was used for 
all the channels. Furthermore, we also assume that the noise 
over the wireless spectrum is AWGN with the variance of the 
noise to be 10~ 9 W/Hz at every node/link. We also used a 
Rayleigh fading wireless channel model. The channel transfer 
functions between the nodes vary independently but they are 
characterized by the same average SNR unless otherwise 
specified. Therefore, the BER of one of the destinations is 
only plotted. 

A. Results for Two Sources 

The related results for two sources and two relays can be 
seen in Fig. [3] For this case an Alamouti-type of code was 
employed by the relays. The Direct mode corresponds to the 
performance of the point-to-point link. The first observation 
is that the Distributed STC system performs similarly with a 
Direct transmission. This is expected since the diversity gain 
that we obtain from the Distributed STC in the R —> D links, is 
minimized only because of decoding errors at the relay for the 
S — >• R transmissions. Now the AF-OST scheme performs 
very well but in the high SNR regime. This is because of the 
noise amplification that occurs at the relays but when the signal 
quality is good, the joint ML decoding algorithm can have 
significant impact on throughput. On the other hand, STSSC 
performs superior to AF-OST even in the lower SNR regime 
because of the collected diversity benefits from the employed 
STC. 
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(a) BER (b) Thorughput 

Fig. 5. BER and throughput results for four sources with C4 4 code, T = 4, 
and all \h x>y \ = 1. 



B. Results for Three and Four Sources 

The selected code for three sources named C3.4, is not a full- 
rate code but it is orthogonal while for four sources the code 
C4.4 is a full-rate orthogonal code 0111 , The related results can 
be seen in Fig. [4] and Fig. [5] When compared with the N=2 
system, the case of iV=3 with the Distributed STC already 
performs better since another node is used for the STC. It 
now outperforms Direct transmission even if the impact of 
decoding errors at the relay still exists. On the other hand 
AF-OST presents minor performance decrease for 7V=3 and 
even more visible for N=A, which indeed shows that N=2 
is the optimal choice for this protocol. This is primarily due 
to noise amplification at the relays that is increased as more 
of them are used. The more encouraging results can be seen 
for STSSC where performance is significantly improved as 
the number of sources is increased while it also outperforms 
significantly the Distributed STC even in the low SNR regime. 
This is an important result since at this point we can observe 
a reversal in the performance trend when compared to AF- 
OST, i.e. the BER is minimized when more sources are 
involved and superimpose their signals. The reason for this 
behavior is that the diversity benefits from the use of the STC 
increase significantly the decoding performance of the joint 
ML detector at the receiver. The key issue is that the same 
superimposed symbol, even though it receives diversity equal 
to the number of relays with AF-OST, it receives a diversity 
equal to the number of relays times T since it is coded and 
re-forwarded from them at each symbol slot. 



VI. Conclusions 

In this paper we introduced the concept of space-time 
superimposed symbol coding. We showed that wireless signals 
that are superimposed over the air can still reap the benefits of 
space-time coding if nodes in the wireless network cooperate. 
This is accomplished first by applying the space-time code 
not on the symbols of interest themselves, but on the non- 
decodable superimposed signal of several symbols, and second 
by shifting the decoding to the final destinations. Thus, relays 
are still less complex in the sense that they are not required 
to perform any decoding operation besides the STC creation 
and at the same time their presence is fully exploited. Per- 
formance results show that significant throughput benefits can 
be observed over a distributed STC scheme. Also significant 
performance improvement is observed in the low SNR regime, 
an area where cooperative PLNC schemes traditionally suffer 
due to noise amplification. 
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